To solve the problem of detecting human hand in complex background based on traditional camera, a fast, automatic method was proposed which can accurately detect and track foreground human fingertips by using Kinect camera. This method firstly used a combined vision-based information to roughly extract the hand region, then, by taking advantage of depth information, a bare hand could be successfully segmented without connecting to background. Subsequently, the fingertips of that bare hand could be extracted by using minimum circle and curvature relationship on the hand boundary. Finally, to improve the detecting accuracy, the fingertips were optimized by using Kalman filter. The experimental results show that compared with existing method the algorithm can successfully track the 3D locations of fingertips under multiple hand poses and with much lower error rate.